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(57) Abstract 

The present invention is a method for the isolation and characteri- 
zation of C glutamicum genes involved in amino acid biosynthesis, spe- 
cifically, encoding hom^ thrB^ and thrC, and sequences regulating their 
expression. Techniques for modifying or replacing these sequences and 
means for facilitating further isolations and characterizations, including 
promotor probe vectors which are useful in screening for high efficiency 
and regulatable promoters and repressors, are also disclosed. A C glu- 
tamicum genomic library was constructed by cleaving chromosomal 
DNA with restriction enzymes, inserting the DNA fragments into an ap- 
propriate vector, and transforming the resulting recombinant molecules 
(rDNA) into C. glutamicum. Amino acid biosynthetic genes horn, rhrB, 
and //»/C, encoding homoserine dehydrogenase, homoserine kinase, and 
threonine synthetase, respectively, were isolated by complementation of 
C. glutamicum auxotrophs. The hom-thrB genes were subcloned on a 3.6 
kb Sail generated chromosomal fragment while thrC activity was isolat- 
ed from a second recombinant plasmid within the genomic library and 
subcloned on a 2.7 kb Sph\ generated fragment The hom-thrB and thrC 
loci, and regulatory sequences, were identified by enzyme assays, com- 
plementation of defined £. coli auxotrophs, SI nuclease and deletion 
mapping. 
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C. GLUTAMICUM THREONINE BIOSYNTHETIC PATHWAY 



Background of "the Invention 

The present invention is generally in the field 
of genetic engineering, and specifically, in the 
area of manipulation of amino acid biosynthesis in 
Gram positive bacteria • 

Corynebacterium glut ami cum is a Gram positive, 
nonpathogenic microorganism that has long ocGupied a 
central role in the industrial production of amino 
acids by conventional fermentation processes. Past 
strain development has primarily depended on classi- 
cal mutagenesis to remove competing pathways to 
thereby increase substrate availability, and to 
remove or reduce regulatory control of a particular 
biosynthetic pathway. Regulatory mutants were 
isolated by selecting strains resistant to toxic 
amino acid analogues. The use of chemical muta- 
genesis has been very successful and a number of 
economically viable L-amino acid fermentation 
strains, such as strain . producing L-glutamate and 
L-lysine, have been established. 

The recent development of cloning vectors, 
including those described in U.S, Patent No. 
4,649,119 to Sinskey et al., and methods for DNA 
transformation of glutamicum ^ as decribed by 
Katsumata et al - , J. Bacteriol. 159,306-311 (1984), 
and Yoshihama et al,, J. Bacterid . 162, 591-597 
(1985) , and the closely related Corynebacterium 
(Brevibacterium) lactof ermentum described by 
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Santamaria et al> in J , Gen , Kicr ob iol . 13 0, 

2237-2246 (1984) , initiated a new era in the genetic * 
manipulation of these organisms. 

However, the commercial utilization of C. 
q-lutamicxm recombinant DNA technologies for future 
strain development is dependent on the development 
of additional genetic tools and a better understand- 
ing of the fundamental molecular biology of this 
species. The use of recombinant DMA techniques to 
develop industrial strains would offer several 
advantages over classical mutagenic strategies.. . For 
example, specific alterations such as the replace- 
ment of a low efficiency promoter would be possible, 
the stepwise isolation of enhancing mutations could 
be avoided, regulatory systems could be engineered 
to allow the temporal control of gene expression 
during a fermentation process, and novel genes 
and/ or pathways could be introduced into an 
organism. 

It is therefore an object of the present 
invention to isolate and characterize genes encoding 
components of amino acid biosynthetic pathways in 
Corynebacterium . 

It is another object of the present invention 
to clone the isolated amino acid biosynthetic genes, 
specifically those involved in the threonine bio- ^ 
synthetic pathway. 

It is still another object of the present 
invention to elucidate the structure of these genes ^ 



6^4SDOClD; <Wp_ 880981 9A2. J , ^ 



wo 88/09819 PCT/US88/02029 



3 

and the regulatory mechanisms that modulate their 
expression • 

It is a further object of the present invention 
to characterize and modify the expression of the 
cloned, amino acid biosynthetic genes, as well as 
the primary structure and regulatory features of 
their protein products. 

Summary of the Inventri-on 

The present invention is a method for the 
isolation and characterization of C. qlutamicum 
genes involved in aimino acid biosynthesis, 
specifically, hom, thr B, and thrC, and sequences 
regulating their expression. Techniques for modify- 
ing their expression and regulation are also des- 
cribed. Methods and sequences facilitating further 
isolations and characterizations are also disclosed, 
including promoter probe 'vectors which are useful in 
screening for high efficiency and regulated 
promoters. 

A C. qlutamicum genomic library was constructed 
by cleaving chromosomal DNA with the restriction 
enzyme Mbol, inserting the resultant DNA fragments 
into a C. qlutamicum/Bacillus subtilis shuttle 
vector, pHY416, and transforming the resulting 

recombinant molecules into C. qlutamicum . Amino V 
acid biosynthetic genes hom , thrB, and thr C, encod- 
ing homoserine dehydrogenase, homoserine kinase, and 
threonine synthase, respectively, were isolated by 
complementation of C. qlutamicum auxotrophs. The 
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hom - 'thrB genes were sub cloned on a 3 • 6 kb Sai l 

generated chromosomal fragmeni: while tAr C activity ' 

was isolated from a second recombinant plasmid 

w^ithin the genomic library and siibcloned on a 2.7 kb 

S£hl generated fragment. The hom-thrB and thrC loci 

were identified by a combination of enzyme assays 

and complementation of defined £. coli auxotrophs, 

and amino acid sec[uence homology. 

Enzymatic assay of homoserine dehydrogenase 
activity, encoded by hom, in strains harboring the 
cloned gene demonstrated a 20-fold increase in 
specific activity compared to wild type controls - 
Both the chromosomal and plasmid encoded activities 
are strongly inhibited by L-threonine cind repressed 
by li-methionine. The li-methionine repression of the 
plasmid encoded activity demonstrates that the 
structural gene and sequences responsible for its 
expression are included within the cloned fragment. 
Southern hybridization analysis demonstrated that 
the hom/thr B and thr C loci are separated by a 
minimum of 8.3 }cb in the c . glutamicum chromosome. 
This is a different genomic organization from that 
observed in E. coli where the three genes represent 
a single operon. Three lines of evidence demon- 
strate that the C. glutamicxam hom - thr B genes repre- 
sent an operon. First, they are located together 
(separated by ll base pairs) and coordinately 
regulated by L-methionine. Secondly, Northern 
hybridization analysis has identified a single 2.4 ^ 
kb, L-methionine repressed RNA transcript. 



BNSDOCID: <WO 880981 9A2_t.> 



PCT/US88/02029 



5 

consistent with the size of the two coding regions. 
Finally, deletion of the promoter upstream of the 
hom gene significantly reduces the expression of 
both the hom and thr B genes. 

The hom-thrB and thr C promoters were identified 
by complementation of auxotrophs, deletion analysis 
and SI nuclease mapping. The hom - thr B operator, a 
hyphenated dyad symmetry element, was also identi- 
fied by deletion analysis. Methods for modifica- 
tion, removal or replacement of these regulatory 
elements are described. 

Brief Description of the Drawings 

Figure 1 is a schematic of the threonine 

biosynthetic pathway. 

Figure 2 is a graphic depiction of subcloning 

strategy and restriction maps of recombinant plas- 

mids pFS78, pFSSO, pFS3.6A, pFS3.6B, pSPCl and 

pSPC4. 

Figure 3 is the nucleotide sequence and pre- 
dicted protein sequences of hom and thr B. 

Figure 4 is the nucleotide sequence and pre- 
dicted protein sequence of the thrC gene. 

Figure 5a is the sequence of the C. qlutamicum 
hom - thr B regulatory region indicating the mRNA 
initiation site, -35 and -10 regions of thrPl and 
the hyphenated dyad symmetry element responsible for 
methionine mediated repression ( thr O) . 

Figure 5b is the potential stem/loop structure 
formed by the hyphenated dyad symmetry element. 
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Ficfxire 6 is a schematic of the construction of 
£• glutiamiciam hom-ttirB promoter deletions and subse- 
quent analysis. 

Figure 7: Deletional analysis of the hom - thr B 
promoter region. The 3.6 kb C. crlutamicum chromo- 
somal DITA insert of pFS3.6 carrying the hom-thr B 
genes is indicated as a hatched box and the nucleo- 
tide sequence of the relevant promoter-containing 
Dra l-Hindlll fragment is shown = The extent of Bal31 
generated deletions in various plasmid constructs 
based on vector pWSTl are presented as black bars. 
The start of transcription as determined by SI 
nuclease mapping is indicated by an arrow. 

Detailed Description .of the Invention 

Recombinant DNA technology has been used to 
isolate, characterize and manipulate genes involved 
in the amino acid biosynthetic pathway of Coryne- 
bacterium glutamicum . The technology and results 
obtained aid in the elucidation of the fundamental 
molecular biology of glut ami cum and construction 
of amino acid producing strains, particularly 
threonine. 

Threonine is produced in a series of reactions 
beginning with the reduction of the beta-carboxyl 
group of aspartic acid to from the aldehyde, 
aspartic beta-semialdehyde, which takes place via an 
acyl phosphate intermediate, beta-aspartyl phos- ^ 
phate, in an ATP requiring reaction. Aspartic 



/ 
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beta-semialdehyde is converted by homoserine de- 
hydrogenase, encoded by hom, to homoserine. Homo- 
serine is phosphorylated by homoserine kinase, 
encoded by thr B, to homoserine phosphate in an ATP 
requiring reaction. The product, homoserine phos- 
phoric acid, is in turn converted to threonine by 
threonine synthase, encoded by thr C, a pyridoxal 
phosphate enzyme. Threonine, the end product of the 
sequence, is an inhibitory modulator of aspartate 
kinase. This reaction pathway is demonstrated in 
Figure 1. 

Genes encoding the three enzymes , homoserine 
dehydrogenase (horn) , homoserine kinase (thrB) , and 
threonine synthetase (thrC) have been isolated, 
identified, cloned, and their expression modified as 
follows. The cloning and determination of the 
nucleotide sequence of these genes provides a means 
for manipulating the expression and catalytic 
properties of the encoded enzymes. Means for 
altering the expression and the end product include 
in vitro mutagenesis of the C* cflutamicum hom gene 
and selection of derivatives resistant to L- 
threonine mediated feedback inhibition, sequence 
determination of feedback resistant derivatives and 
the use of rDNA techniques to combine separate 

genetic alterations, determination and modification V_ 
of the promoter structure and protein start sites 
for hom , thr B, and thrC, increased expression of 
hom-thrB via increased promoter efficiency and 
removal of L-methionine transcriptional repression. 
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and molecular joining of tlie C. glutamicum hom-thrB 

^™ I """"" Ik 

and thrC genes to form a hom-thrBC operon. 
Isolation of thr- and thr-/met- auxotrophs of C> 
glut ami cum . 

A genetic background in which to isolate the 
threonine biosynthetic genes was constructed by 
mutating C. gluteimicum and isolating auxotrophs 
defective in threonine biosynthesis. The C. 
glutamicum is maintained on liB media (10 g ITaGl, IG 
g Bactotryptone, 5 g Yeast extract, 1 1. H^O) or 
minimal medium for C. glutamicum (MCG) (lo g 
glucose, 7 g iim^)^SO^, 3g K^HPO^, Ig KH^PO^, 0.4 g 
MgSO^.7 H^O, 2 mg FeSO^^'^H^O, 2 mg MnSO^.H^O, 1 mg 
Biotin, 10 mg Thiamine, 2 ml trace elements, 1 
l.H^O) . 1.4% agar was added for plates. The trace, 
elements contained 44 mg Na^B^O^ , . TH^O, 20 mg 
(^^) Q^o^O^^.AK^Q, 5 mg ZnSO^,- 135 mg CuSO^.SH^O, 
3-6 mg MnCl^.H^O, 43 5 mg FeCl^ in 500 ml H^O. Where 
appropriate, 50 g/ml L-threonine, 50 g/ml L- 
methionine, 50 g/ml ampicillin, 15 g/ml kanamycin or 
10 g/ml rifampicin were added. 

C. glutamicum AS019, a rifampicin resistant 
variant of ATCC 13059, was grown at 30'C in LB to 
exponential phase (2 x 10° cfu/ml) , harvested by 
centrifugation and resuspended in an equal volxime of 
minimal media for C. glutamicum (MCG) . Cells were 
mutagenized by the addition of nitrosoguanidine 
(NTG) (40 micrograms/ml ) to 1 ml of cells and 

incubation without shaking at 30 "C for 3 0 minutes. * 
Mutagenized cells were harvested by centrifugation. 
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resuspended in 1 ml LB media and diluted 1:100 into 
10 ml aliguots of fresh LB, Following growth at 
sec with shaking to stationary phase, the cells 
were diluted and plated on LB agar, Auxotrophs were 
screened by replica plating onto MCG plates and 
identification by growth patterns on amino acid 
pools. Only one strain displaying a particular 
auxotrophy such as threonine requirement was saved 
from each of the 10 ml aliguots. 

Twenty four thr- and six thr"/met- C, gluta- 
mi cum auxotrophs were isolated. The thr/met- 
auxotrophs grow on MCG plates supplemented with 
homoserine. The thr- auxotrophs may have mutations 
in either of the threonine specific enzymes, homo- 
serine kinase or threonine synthase. 
Transformation and complementation of the C. 
qlutamicum auxotrophs , 

Two threonine requiring auxotrophs of C. 
qlutamicum , AS155 and AS178, were transformed using 
the following method. An overnight culture of AS019 
was inocculated at a ratio of 1:100 into LB broth 
containing 0.2% glucose and 2 . 0% ^glycine. The cells 
were incubated at 3 0"C for 15 hours with aeration. 
10 ml of cells were harvested by centrif ugation and 
washed in SMMC buffer (0.5/M Sorbitol, 2 0 mM MgSO^ , 
20 mM CaCl^, 50 mM Na Maleate, pH 7.0). Cells were 
resuspended in 2 ml SMMC buffer containing 2,5 mg 
lysozyme/ml. .The cell suspension was incubated at 
37 with shaking for 90 minutes. Cells were again 
harvested by centrif ugation at 6000 rpm for ten 
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minutes and resuspended in three ml SMMC buffer. 
0.3 ml aliquots of "protoplasted" cells were placed 
in polypropylene tubes. Plamid DNA in 0.5 M 
sorbitol was added. 0.7 ml of 4 0% PEG, molecular 
weight 3350, 50 mM Tris, 20 mM CaCl^ pH 7.4 was 
added and gently mixed. 2.0 ml of SB broth (0.5 M 
sorbitol, 1 X LB, 2 0 mM CaCl^/ 2 0 mM MgSO^) was 
added to the transformation mixture, which was then 
incubated at 30 'C without shaking for three hours ^ 
The C. qlutamicum protoplasts obtained by growth in 
glycine and lysozyme treatment can also be suspended 
in SMMC and frozen at -80 'C for use in subsequent 
transformations . 

The trans formants were plated out on selected 
plates- The two threonine requiring auxotrophs 
AS 155 and AS 17 8 were transformed with a C. 
glutamicum genomic library containing approximately 
2.5 genomic equivalents constructed in the C. 
qlutamicum/ B . subtilis chimeric plasmid pHY416, 
described by Yoshihama et al., J. Bacterid. 162, 
591-597 (1985) and Follettie and Sinskey in J. Bac- 
terid . 166 695-702 (1986) . Kanamycin resistant 
transf ormants were selected and screened for comple- 
mentation of the threonine auxotrophy by replica 
plating onto MCG/Km plates. Three AS155 trans- 
f ormants and a single AS 17 8 trans formant were 1 
capable of growth without threonine supplementation. 
Plasmids were isolated and characterized by restric- 
tion analysis. * 
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All four transf ormants harbored the same 
recombinant plasmid, designated pFS78, described in 
Figure 2, which contain a 6.8 kb chromosomal DNA 
insert. The recombinant plasmid was transformed 
into 10 of the independently isolated C- qlutamicum 
threonine auxotrophs and three auxotrophs requiring 
threonine and methionine or homoserine supplementa- 
tion- The Km^ transf ormants were screened for 
complejuentation on MCG/Km plates. The results 
demonstrated that pFS78 complements all three 
homoserine auxotrophs and four of the ten thr- 
auxotrophs, indicating that the plasmid carries the 
homoserine dehydrogenase gene, horn , as well as one 
of the threonine specific genes, thr B or thr C. 

Two of the thr- auxotrophs not complemented by 
pFS78, AS148 and AS213, were transformed with the 
genomic library and Km^ colonies screened for growth 
on MCG/Km plates. Both thr+ AS148 and thr+ AS213 
transf ormants were obtained, and their plasmids 
isolated and characterized by restriction analysis. 
All thr-*- transf ormants harbor the same 12.5 kb 
recombinant plasmid designated pFS80, also shown in 
Figure 2, containing a 3.1 kb chromosomal DNA 
insert. The chromosomal sequence cloned in pFSSO 
complements four other thr- auxotrophs not comple- 
mented by pFS78, However, pFSSO was unable to V 
complement the thr- or thr-/met- strains comple- 
mented by pFS78. 

Subcloninq and identification by enzyme assay, 
complementation of auxotrophs, and amino acid 
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sequence homolog y of the C. crlutamicum hom, thrB. 

and thrC locus . * 

Deletion analysis of pFS78 indicated that both 
the thr- and thr -/ met - complementing activities are 
located on a 3.6 kb Sai l generated chromosomal 
fragment. This fragment was purified by agarose gel 
electrophoresis and electroelution and ligated into 
the iinique Sai l restriction site. of the C. 
glutamicum/E. coli chimeric vector pWS12.4. described 
by Batt, Shamnabruch and Sinskey, Biotech. Letts. 
7:717 (1985). The recombinant vector, pFS3.6, 
complements both AS178 (thr-) and AS253 (thr-/inet-) . 
The plasmid also complements E. coli thr B auxotroph, 
E. coli 5076. 

The 2.7 Kb Sph l generated chromosomal fragment 
of PFS80 was purified by agarose gel electrophoresis 
and ligated into the unique Sph l restriction site of 
PUC18. The resulting recombinant plasmids, desig- 
nated pSPCl and pSPC4, also diagrammed in Figure 2, 
were able to complement E- coli 5077 (thrC) but not 
E. coli 5076 (thrB) . 

Southern hybridization analysis was used to 
determine the relationship of the hom -thrB and thr C 
loci. The results demonstrate that the hom-thrB and 
thrC locus in this species are physically separated 
by a minimum of 8.8 kb. ^ 

The homoserine dehydrogenase activity, in crude 
extracts of wild type AS019 was compared to that of 
the homoserine auxotroph AS253 with and without the ' 
complementing plasmid pFS3.6 in order to determine 
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the regulation and extent of overproduction of the 
cloned C. qlutamicum horn gene product. Methods used 
for preparing a C. qlutamicum crude extract 
preparation and assays for homoserine dehydrogenase, 
homoserine kinase and aspartokinase are as follows: 

Homoserine dehydrogenase is measured by the de- 
crease in absorbance at 340 nm due to the oxidation 
of NADPH (extinction coefficient = 6220) The 
reaction mixture contains: 3 mM DL aspartate-beta- 
semialdehyde (ASA), 0-4 mM NADPH, 0.1 M PO^ , pH 7.0, 
0.5 M KCl, and enzyme preparation, in a total volume 
of 0.7 ml. A blank reaction mixture without DL-ASA 
serves as a control. DL-ASA is synthesized by the 
ozonolysis of DL-allyl glycine according to the 
procedure of Black and Wright, J. Biol. Chem. 213, 
39 (1955) . 

Homoserine kinase activity was determined by a 
coupled enzyme assay which measured the reaction 
product ADP. The reaction mixture contained 3.3 mM 
ATP, 0.4 5 MM NADH, 4 . 5 mM phophenol pyruvate, 1.0 mM 
L-homoserine, 10 mM MgCl^ 12.5 units pyruvate kinase 
(Sigma, St. Louis, MO) , 25 units lactate de- 
hydrogenase (Sigma), 0.25 M KCl, 100 mM HEPES buffer 
(pH 7.8) and enzyme preparation in a total volume of 
1.0 ml. The reaction was monitored by the decrease 
in absorbance at 34 0 nm due to the oxidation of 
NADH. The absorbance decrease in the absence of 
added substrate, L-homoserine, was determined and 
subtracted from values obtained with the complete 
assay mixture. 



BNSDOCID: <WO. . e809919A2.l. > 



wo 88/09819 



« 

PCT/US88/02029 



14 

Aspartate kinase activity, inhibited by 
threonine, is determined by measuring the 
aspartohydroxamate produced according to the pro- 
cedure of Black and Wright, J, Biol, Chem, 213, 27 
(1955) - Protein in the crude extract is precipi- 
tated by adding 5 volumes of saturated ammonium 
sulfate and resuspended in 0.3 volxime of buffer 
containing O.l M Tris, pH 7.4, 0.2 M KCl, The assay 
mixtxxre contains: O.l M Tris, pH 7 ,4, 10 mM ATP, 10 
mM MgSO^ 0,6 M hydroxylamine (pH 7,4), 0.6 M (NH^)^/ 
50 mM li-aspartate and enzyme preparation in a total 
volume of 1 ml. After 1 hr incubation at 37 -C, the 
reaction was stopped by the addition of 1.5 ml of 
solution containing 10% FeCl^.e H^O, 3.3% trichloro- 
acetic acid and 0.7 N HCl. After centrifugation^ 
aspartohydroxamate concentration is measured by 
absorption at 540 nm (extinction coefficient = 600) . 
A blank reaction mixture without L-aspartic acid 
serves as a control. 

Protein concentration of the crude extracts is 
determined using the Bio-Rad protein assay with 
bovine serum albumin standards (BioRad Laboratories, 
Richmond, CA) . 

qlutamicum thr"/inet" strain AS253 
harboring the parental vector pWS124 had less than 
2.5% of the homoserine dehydrogenase activity 
present in the wild type AS019. Introduction of the 
cloned C. glutamicum horn gene present on pFS3.6A 
into C. glutamicum AS253 leads to a twenty-fold 
increase in the specific activity of homoserine 
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dehydrogenase over that observed in wild type C. 
qlutamiciun AS019. The orientation of the cloned hoin 
gene with respect to the vector affected its expres- 
sion. Crude extracts of AS253 harboring pFS3.6B 
demonstrated an 11-fold increase in homoserine 
dehydrogenase activity relative to wild type. 

The level of aspartokinase in glutamicxim 
AS 019 harboring either the parental vector pWS124 or 
the" recombinant vector pF53.6 was unchanged over 
that observed in the controls* Further, the 
aspartokinase specific activity was not repressed by 
growth in MCG supplemented with 2.7 mM L-methionine . 
The differential transcriptional control of homo- 
serine dehydrogenase, in combination with the lack 
of increased aspartokinase activity in cells harbor- 
ing pFS3.6 (hom-thrB) , demonstrates that the two 
activities are not catalyzed by a bifunctional 
protein as in coli . The expression of the 
encoded homoserine dehydrogenase is repressed 3 . 2 
fold by the addition of 2.1 mM L-methionine* 
Expression of the qlutamicum thr A gene is also 
repressed by L-methionine, demonstrating that the 
expression of the pFS3.6 encoded horn gene is medi- _ 
ated by its native promoter/operator. Expression of 
the cloned C. qlutamicum thr B gene was similarly 
repressed 2,6-fold by 2.7 mM L-methionine. 

The activity of the homoserine dehydrogenase, 
both chromosomal and plasmid encoded, is inhibited 
by the addition of L-threonine to the assay mixture. 
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Addition of 1 mM D-threonine or L-methionine does 
not affect the homoserine dehydrogenase activity. 

The complete nucleotide sequence of 3704 bp is 
shown in Figure 3 for hom and thrB. Two long open 
reading frames (ORF's) extend from position 907 to 
2329 and from 2312 to 3269. The protein sequences 
of homoserine dehydrogenase and homoserine kinase 
are predicted on the basis of the sequence extending 
from the first potential translation initiation 
codon, either ATG or GTG (position 994) to the TAA 
stop at position 233 0 for ORFl and from the ATG at 
2342 to the TAG stop 3269. The predicted proteins 
have molecular weights of 46,436 and 32,618 daltons 
for ORFl and 0RF2, respectively. A translation 
terminator is present at position 3279 to 3 311, 
seven nucleotides downstream of the TAG stop codon. 
This is shown in further detail in Figure 5. The 
sequence forms a strong step-loop structure having a 
stem length of 15 bp and a seven base loop similar 
to the rho- independent terminators from coli. 
The 5' sequence to ORFl has a region strongly rich 
in A:T containing the hom-thrB promoter and site of 
action of the methionine mediated repression. 

The DNA sequence of the chromosomal DNA insert 
in pFSSO, encoding threonine synthase (thrC) , was 
also determined by dideoxy sequencing techniques and A. 
is shown in Figure 4. A restriction map was pre- 
dicted and checked against restriction analysis 
results to corroborate the acctoracy of the sequence " 
data. Computer aided analysis (UWGCG Programs, UW 
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Biotechnology Center, University of Wisconsin) was 
used to predict the thrC gene within the sequence 
data. These results were compared with in vivo 
genetic deletion analysis. An open reading frame 
extends 5' to GTG at 39 6) marking the amino terminal 
region of thrC. 

The threonine synthase activity maps within the 
1,57 kb Bcll-Stul restriction fragment. The 
computer -predicted structural gene secjuencsy GTG 
(396) to TAA (1881), lies completely within this 
fragment. The Stu I restriction is 17 6 bp 3' of the 
preducted translation stop codon. 

Heterospecif ic genetic complementation of the 
E. coli thrC 1001 auxotroph shows that the C. 
qlutamicum thr C gene is expressed in coll. By 
comparison, using computer searches for regions 
similar to coli ribosome binding sites and 
translation terminator sequences, a ribosome binding 
site adjacent to GTG (396) and a significant termina- 
tor-like sequence 35 bp 3 ' of the TAA at 1881 were 
identified. Homology was detected between C. 
qlutamicum and coli thr C regions at both DNA and 
predicted protein sequence levels. Limited 
conservation of DNA sequence was observed between 
the coli thrC gene and the region 4 00 to 1400 bp 
of the Cj^ qlutamicum thrC sequence. There is 
consistent conservation in the central region 
(residues 100 to 350 of C^^ qlutamicum thr C) and the 
carboxy terminal residues 4 30 to 480. 
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Identification of the hom-torB t.ranscript:ion start 
site by SI nuclease mappinq- and deletion analysis . 

The transcriptional start site for the C, 
qlutamicxm hom-thrB genes was identified using SI 
nuclease mapping, as described by Berk and Sharp, 
Cell 12, 721 (1977) . The procedure requires the 
isolation and denaturation of a DNA fragment which 
overlaps the promoter and has been label at the 

5* end of the antisense strand. Hybridization of 
this fragment to its cognate mRNA and subsequent 
digestion with the single strand specific exo- 
nuclease SI results in the degradation of the 3* end 
of the labled DNA fragment up to the point at which 
it is protected by the RNA. The size of the result- 
ing DNA fragment is determined by comigration with 
DNA fragments resulting from the sequencing reac- 
tions of Maxam and Gilbert, Methods in Enzymol. 65, 
499-559 (1982) . This enables the identification of 
the transcriptional start site- The results can 
then be confirmed by deletion analysis of the 
promoter using restriction enzymes and exonuclease 
Bal 31 to construct series of deletions which are 
then reinserted into the organism and assayed for 
activity. 

The Sma l- Hin dlll restriction fragment that 
encompasses the hom - thr B promoter/ operator and the 
first seven amino acid residues of the hom gene 
product was used in the SI nuclease mapping studies. 
Plasmid pRAl (pUClS containing the 3,6 kb Sail C. 
qlutamicum genomic fragment encoding hom - thr B) was 
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cut with Hin dlll to generate a 1-014 kb restriction 
fragment, dephosphorylated with CIP (calf intestine 
phosphatase, Boehringer-Mannhein Biochemicals, 
Indianapolis, IN) , labelled by treatment with 
polynucleotide kinase and gamma P-ATP (specific 
activity greater than 5000 Ci/mmol, Amersham Corp. 
Arlington Heights, IL) , subsequently cleaved with 
Sma l (New England Biolabs, Beverly, MA) to produce a 
242 bp DNA fragment that was then purified by 
preparative polyacrylamide gel electropheresis . All 
manipulations were carried out in accordance with 
procedures described in Molecular Cloning' by T. 
Maniatis et al. (Cold Spring Harbor Laboratory, Cold 
Spring Harbor, NY, 19 82) and the enzyme suppliers 
recommendations • 

Understanding the expression of a gene requires 
the isolation and structural characterization of the 
mRNA product, the size and number of transcripts, 
the regulatory control and the site of transcription 
initiation. Criteria evaluated for RNA isolated 
from Corynebacteria include RNA quality (Abs^^Q^ 
Abs^Q,., ratio of about 1.95 to 2.05), purity 

2 o O 

(degradation and contamination determined by agarose 
gel electropheresis) , and yield in mg RNA/liter 
cells (Abs^gQ = 4 micrograms RNA/ml) . 

RNA was extracted from C. qlutamicum AS019 
using the guanidinium isothiocyanate/French press 
isolation method. In this method, all. LB culture 
of AS019 is grown at 30 'C to late exponential phase 
and harvested by 10 minutes centrif ugation at 5000 
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RPM in a Sorvall GSA-250 rotor. The cells are 
washed at: 4*'C in 0.1 M NaCl, 10 mM Tris.Cl, pH 8.0, 
1 mM EDTA, and harvested again. The pellets are 
combined in 50 ml 4 M guanidinixim isothiocyanate, 
2-mercaptoethanol CGuT:2ME) and immediately lysed by 
compression through a French press at approximately 
1500 psi. Cell debris is sedimented by centrifuga- 
tion in a Sorvall SS-3 4 rotor for 10 minutes at 
10^000 rpm- Six ml aliguots of the supernatant are 
applied to 4 mil of 5.7 M CsCl, 10 mM ETTA, 25 mM 
sodium acetate and centrifuged at 34,000 rpm in a 
BecJonan Ti50 fixed angle rotor for 24 hours. 

The density gradient separates the sheared DNA 
molecules from the RNA, which forms a pellet at the 
tube base. This RNA pellet i$ resuspended in 5 ml 
10 mM Tris, pH 7.5, 1 mM EDTA 5.0% Sarkosyl (TESK) 
containing 5.0% phenol. The solution is made 0.1 M 
with 5 M NaCl, and extracted with 10 ml 50% phenol, 
49% chloroform, 1% isoamylalcohol (PCIA) . The 
phases are separated by centrifugation in a Sorvall 
SS-34 at 3,000 rpm for 5 minutes and the phenolic 
phase back extracted with TESK containing 0.1 M 
NaCl. The combined aqueous phases are made 0.2 M 
with sodium acetate, pH 5.5, and the RNA precipi- 
tated overnight at -20 in 2.5 volumes of ethanol. 
After centrifugation at 10,000 rpm for 20 minutes at 
4*C, the RNA pellet is washed in ethanol, dried 
under vacuum, and resuspended in RNase free water at 
a concentration of 0.5 mg/ml. 
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Total cellular RNA, isolated from AS019 grown 
in minimal media with and without L-methionine (4 00 
microgram/ml ) supplementation, was separated by 
agarose gel electrophoresis, transferred to nitro- 
cellulose paper and probed either with pMF-L2 or 
pUC-B5. Plasmid pMF-I»2 contains a 1.8 kb Nael 
fragment which spans both the horn and thr B genes but 
contains no flanking sequences. RNA is glyoxylated 
to prevent spurious electrophoretic patterns caused 
by potential secondary structure* For each lane, 2 0 
micrograms of C, qlutamicum RNA is suspended in 8 
microliters of glyoxal reaction mixture (1 M 
glyoxal, 50% DMSO, 10 mM potassium phosphate, pH 
7.0) and incubated 1 hour at Glyoxylated RNA 

samples are prepared for loading by the addition of 
17 microliters formamide, 6.2 microliters formalde- 
hyde, 3 microliters lOx running buffer 0.2 M 
morpholinopropanesulf onic acid (MPOS) , 50 mM sodium 
acetate, 10 mM EDTA) and 5 microliters loading dye 
(50% glycerol, 1 mM EDTA, 0.4% bromophenol blue, 
0-4% xylene cyanol) . Samples are loaded onto an 
agarose/f ormaldehyde gel (2.2% agarose, 1 x running 
buffer, 18% formaldehyde, pH adjusted to 7.0 with 
NaOH) and electrophoresed at 30 mA. Hindlll re- 
stricted lambda DNA is labeled with "^^P, denatured 
and glyoxylated similar to RNA samples and utilized 
as a molecular size standard. Following electro- 
phoresis, nucleic acid is transferred to a nitro- 
cellulose filter using the technique of Southern, J . 
Mol. Biol. 98, 503-517 (1975) except that no prior 
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treatment of the gel was necessary. Following 

transfer for 15 hours, the filters are baked in 

vacuo at 80 "C for 2 hours • 

The filters are prehybridized in sealed plastic 

bags for 16 hours at 42* C in a minimum volxime, 

approximately 10 mis of hybridization buffer (50% 

deionized formamide, 5 x SSC, 50 mM sodium acetate, 

pH 6.5, 25 micrograms sonicated denatured salmon 

s perm DNA, 0.02% bov ine serum albumin, 0. a2_% Ficoll, 

0.02% polyvinyl pyrrolidone) • (SCC = 0.15 M NaCl. 

0.015 M sodium citrate, pH 7.0). The DNA probe is 
3 2 

labeled with P by nick translation, according to 
Rigby et al., J. Mol. Biol. 113, 237-251 (1977), 
heat denatured, added to the hybridization buffer, 
and inctibated with the filter for 20 hour at 42 'C. 
Filters are subsequently washed five times in 2 x " 
SSC/ 0.1% SDS at room temperature and then three 
times in 0.2 SSC at 50 'C. After drying, the filters 
are exposed to X-ray film and specific bands of 
hybridization determined by autoradiography. 

Hybridization of pMF-L2 to total C. glut ami cum 
RNA leads to the appearance of a single 2.4 kb 
transcript. This observation is in agreement with 
the predicted size of the hom-thrB transcript (2408 
base pairs) , based on SI nuclease mapping, and the 
computer predicted termination site of the thr B v 
gene. The size of the observed tremscript and the 
lack of a detectable second transcript hybridizing 
to the hom - thr B probe leads the conclusion that C^ * 
glut ami cum expresses horn and thr B from a single 
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transcriptional unit, representing the first defined 
operon in this organism. Results obtained by SI 
nuclease mapping of the thrA-thrB junction support 
this conclusion. 

Hybridization of hom-thrB specific RNA to the 
242 bp Sinal-HindDIII end-labeled probe was achieved 
by lyophilizing 3 0 micrograms C . glutamicum RNA with 
10 ng probe DNA and resuspending in 10 microliters 
hybridization buffer (40 mM PIPES pH 6.4, 0.4 M 
NaCl, 1 mM EDTA, 80% deionized formamide) . The DNA 
was denatured by heating at 90 'C for 10 minutes. 
Hybridization was performed overnight at 49*'C. 

The hybrid DNA-RNA molecules were digested with 
2,000 units SI nuclease (Bethesda Research Labora- 
tories, Inc. , Gaithersburg,, MD) in 235 microliters 
assay buffer (250 mM NaCl, 30 mM sodium acetate, 10 
mM zinc sulphate, 20*0 microgram/ml calf thymus DNA) • 
The digest was incubated at 37**C for one hour and 
terminated by extraction with 2 50 microliters PCIA 
(phenal/chlorof orm/isoamyl alcohol, 50:48:2). 
Nucleic acids were precipitated from the aqueous 
phase with 0.2 M sodium acetate, 2 micrograms yeast 
tRNA and 50 microliters ethanol at -2 0 'C. Following 
centrifugation and drying, each sample was dissolved 
in 3 microliters formamide loading buffer (100 ml 
formamide, 0.72 g Na^ EDTA, 0.03 g bromophenol blue) 
and applied to a 6% polyacrylamide/7 M urea sequenc- 
ing gel. The 242 bp Smal-Hindlll restriction 
fragment was sequenced using the procedures for the 
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G, C+T and C reactions as described in Maxam and 
Gilbert, Methods in Enzymol, 65, 499-559 (1982) . 

TJae Smal-Hindlll fragment labeled at the 
Hin dlll 5 •terminus (antisense strand) acts as a 
specific probe to RNA complementary to th.is region. 
To detect the start of the hom-thrB mENA transcript, 
total RNA is hybridized to the labeled probe and the 
unprotected single stranded nucleic acid digested 
with the single strand soecific SI nuclease* The 
length of the protected region of the DNA probe is 
resolved on a denaturing 6% polyacrylamide gel. The 
hom - thrB transcript initiates at coordinate 9 06, at 
the first of a GG doxablet. This nucleotide is 88 bp 
5» of the first available ATG codon in the horn open 
reading frame. This defines the promoter .region 
responsible for hom-thr B expression and is 
designated thrPl for threonine promoter 1. The 
sequence is shown in Figure 5a • 

No detectable degradation of the DNA probe from 
the 0.46 kb Fokl- Pvull restriction fragments span- 
ning the horn thr B Pvull junction, indicating that 
the majority of the thr B expression was mediated by 
thrPl. 

Identification and deletion of the operator 
mediating- L-methionine repression of hom-thrB 
expression. 

In addition to promoter identification, 
restriction and/or exonuclease Bal 31 deletions have 
been utilized in identification and deletion of the 
operator (thrO) , which mediates the L-methionine 



BNSDOCID: <WO 88098 19A2_L> 



wo 88/09819 



PCT/US88/02029 



25 

repression of hom-thrB expression and in the con- 
struction of a feedback inhibition deficient variant 
of the hom gene* These studies were facilitated by 
construction of a special vector designated pWSTl . 
When investigating promoter structure and function 
on a plasiaid, it is desirable to eliminate read 
through transcription from upstream promoters 
located within the cloning vector. Plasmid pWSTl 
contains the E. coli tr^A terminator followed by a 
polylinker to facilitate the cloning of the gene in 
the various deletion generated variants. The effect 
of the deletions can be assayed in the absence of 
influence by upstream promoters. This vector is 
applicable not only to. the analysis of the hom - thr B 
genes, but alsto the characterizations of other 
promoter/operator systems in C. cylutamicum . pWSTl 
is constructed using the trpA terminator obtained 
from Pharmacia Fine Chemicals, Piscataway, N J . Sacl 
linkers are attached and the trp A terminator in- 
serted into the polylinker region of M13mpl9 • The 
constructs are sequenced using the method of Sanger 
et al. Proc. Natl > Acad. Sci. USA 74,5463-5467 
(1977), to screen for insertion of the terminator in 
the proper orientation. The terminator/polylinker 
is subsequently ligated into the Smal-Sall re- 
stricted pTF3 3, a derivative of the C. glutamicum/E . 
coli shuttle vector pWS124, described by Batt et 
al. , Biotechnol . Letts . 7:717 (1985) . DNA linkers 
and enzymes are obtained from New England Biolabs or 
Boehringer-Mannheim, as noted earlier. 
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The liom-thrB genes ligated into Sma l/ Sal l re- 
stricted pWSTl on a 29 LI bp Smal-Sall restriction 
fragment and a 2,815 bp Dral-Sall restriction frag- 
ment, designated pWFS2,9 and pWFS2*8, respectively. 
Further deletion of the hom-thrB upstream region is 
accomplished as diagramed in Figure 6, The recombi- 
nant vector pWFS2.9 was linearized by Smal digestion 
and deletions constructed by digestion of 6 micro- 
grams of DNA with- the expnuclease Bal 31 (0 = 2 units) / 
micrograms DNA. Aliquots of the reaction mixture 
were removed at 30 second intervals between 4 and 15 
minutes^ and the reaction stopped by dilution into 
one volume of 50 mM EDTA. The DNA was digested with 
Sail and the resulting hom-thrB containing fragments 
purified by agarose gel electrophoresis. These 
fragments were ligated into Sma l -Sal l digested pWSTl 
and the resulting recombinant mixture used to 
transform C. qlutamicum AS253 (horn) . ' 

The extent of the Bai:^l generated deletions, 
diagrammed in Fig, 7,. in complementing a non- 
complementing derivative plasmid is deteriained by 
nucleotide sequence analysis and measurement of the 
levels of homoserine dehydrogenase activity in crude 
extracts. The ability of the deletion plasmids to 
complement the hom - thr B auxo trophy of strain AS 25 3 
was checked by streaking the corresponding AS253 V 
transf ormcmts onto MCG/kanamycin agar plates. 

The results of the deletion construction and 
their effect on horn gene expression show that ^ 
deletion of sequences upstream of horn , up to the 
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Smal (pWFS2.9) or Dral (pWFS2.8) restriction site 
(218 and 124 bp prior to the predicted translational 
start site, respectively) does not drastically 
influence the expression of the horn gene. 

As predicted from the SI nuclease studies, the 
Dral-Sall hom - thr B fragment contains both the -10 
and -35 riegions which are critical for promoter 
activity in E. coli / as reported by Hawley and 
Mcei-ure, Nucleic Acid ^e s . 11 ; 2^ 3 2 (1S83 ) . Furthar 
deletion by Bal 31 markedly reduces the expression 
of the hom gene product homoserine dehydrogenase, 
supporting the data obtained with. SI nuclease 
mapping of thrPl. Two of the deletion derivatives, 
pWFSdelta2304 and pWFSdelta2207 are able to comple- 
ment C. -qlutamicum AS253 despite the reduction in 
hom expression to 2 and 10%, respectively, of that 
observed in strains containing the parental plasmid 
pFS3.6. The relative specific activity of homo- 
serine dehydrogenase observed in C. glutamicum AS253 
(hom) harboring these, two deletion derivatives is 
3.1 and 0.7 with respect to that observed in wild 
type strains. c. glutamicum auxotrophs requiring 
threonine/methionine express approximately 2% of the 
wild type level of homoserine dehydrogenase activ- 
ity. The deletion of the hom promoter carried in 
pWFSdelta2431 results in a 96-fold decrease in the V, 
expression of the cloned thr B gene thus demonstrat- 
ing a common promoter. 

The Bal 31 generated deletions enable the 
mapping of the boundary between those deletion 



BNSDCXID: <WO.._ 88096 19A2J_> 



wo 88/09819 



PCr/US88/02029 



28 

derivatives which, complement the threonine/ 
methionine axixotrophy and those that fail to comple- 
ment, between 63 base pairs (pWFSdelta23 04) and 56 
base pairs (pWFSdelta2431) upstream of the predicted 
horn translation start sites. The observation that 
deletions extending to locations beyond the putative 
start point of transcription (88 bp upstream of the 
hom start codon) is determined by SI nuclease 
mapping, does not necessarily result in the complete 
loss of homoserine dehydrogenase activity. This 
loss may also be due to weak promoter activity 
adjacent to the main transcription start site. 

The mechanism of transcriptional regulation of 
the hom-thrB operon was determined to involve 
control by the single stem/loop attenuator shown in 
Figure 5. Specific- deletion of the stem/loop 
structure removes the methionine repression of 
hom-thrB expression. In this structure, the se- 
quence ATGTAG, encoding Met-Stop, forms the loop. 
The sequence TTTTGGACA, similar to the TTGGAGA that 
precedes the predicted translational start site of 
the hom gene, precedes the ATG and thus represents a 
potential ribosome binding site. A possible model 
is one in which a boxind ribosome can be momentarily 
stalled due to a low concentration of charged 
methionine tRNA, thus preventing stem/loop formation 
and allowing transcription to continue. At higher 
concentrations of methionine, the ribosome would 
move to the TAG Stop signal and disengage, allowing . 
stem/ loop formation. 
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Lac Z protein fusions can be used to evaluate 
promoter and operator functions directly in C. 
qlutaiaicum . This method is demonstrated using the 
hom-thrB promoter/operator, isolated on a 33 0 bP 
Smal-Hindlll fragment purified and ligated into 
similarly digested pSKSlO? that contains a 
promoterless lactose operon. The construction 
creates a hom-lacZ protein fusion containing the 
N-terminal eight horn amino acid residues preceding 
the lac2 gene product beta-galactosidase. The 
expression of the fusion protein products required 
the insertion of a ribosome binding site, initiating 
codon (ATG/GTG) , under the control of the hom-thrB 
promoter/ operator . The recombinant vector was 
introduced into E- coli JM8 3 where beta- 
galactosidase activity as' observed in crude ex- 
tracts, supplementation of the growth medium with 
L-raethionine represses the expression of lac Z 
two-fold. 

Deletion of a portion of the dyad symmetry 
element required for operator function demonstrates 
the role of the dyad symmetry element in the regula- 
tion of the hom - thr B gene. A 1.4 8 kb Kpn l 
restriction fragment containing the horn gene of the 
Bal 31 deletion derivative, pWFSdelta22 07 was 
purified and used to replace its counterpart in the 
parental vector pFS3.6A. The resulting recombinant 
plasmid, designated pWFS2207deltal , contains a 
specific 10 base pair deletion removing the left 
half of the dyad symmetry element. Identical levels 
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of homoscrine dehydrogenase activity were meastired 
in strains grown in MCG medium with and without 
methionine supplementation. This demonstrates that 
the dyad symmetry element is the site of 
L-methionine repression (thrO) . 

Identification of the thrC promoter by deletion 
analysis^ complementation of auxotrophs and 
overproduction of the enzyme > 

The promoter sequence for expression of thrC 
was also determined by deletion analysis, 
aiixotrophic complementation, and overproduction of 
enzyme. 

The overproduction of the product of the thr C 
gene, threonine synthetase, can be measured from 
crude extracts of C. g-lutamicvim strains AS213 ajid 
wild type AS 019 containing the parental vector 
pHY416 or the thr C containing pFSSO. The two 
strains containing the plasmids are grown in MCG 
medium, the cells harvested, lysed, cell debris 
removed by centrifugation, the protein purified by 
40 to 60% ammonium sulphate fractionation, DEAE- 
Sephadex column chromatography with a 0.2- M to 0.2M 
to 0.6 M KCl gradient, and anion exchange 
chromatography in a FPLC column dluted with a 0.1 M 
to 0.7 M KCl gradient. The results demonstrate that 
the protein is produced at a level 200% of that 
observed in the wild type. The method produces 
threonine synthetase specific activity demonstrating 
a purification of over 350-fold. The protein has a 
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molecular weight of 56,000 by SDS polyacrylamide gel 
electropheresis • 

Analysis of activity complementation of the 
thrC auxotroph indicates that the promoter for the 
thrC gene sequence precedes the predicted transla- 
tion start site for the thrC gene product by 
approximately 8 0 base pairs. The sequence, TTGAAA — 
(16 bp) — TAGGGT, is Closely related to the E. coli 
consensus sequence as well as the promoter sequence 
determined for C. qlutamicum thr Pl, AAAGCA — 18bp — 
TATAGT, Confirmation of the identification of the 
sequences the thr C gene promoter sequence is done by 
SI nuclease analysis. 

Modification of the enzyme structure and expression 
of hom^ thrB and thrC . 

Once the hom, thrB and thr C genes are identi- 
fied, including the identification if the initiation 
sites of both mRNA and protein synthesis for the 
genes, it is possible to increase the quantity of 
gene expression by increasing the gene dosage by 
localization of specific genes on a multicopy 
plasmid, by site-directed mutagenesis or replacement 
of the promoter, by increasing translational effi- 
ciency through alteration of the ribosome binding 
site, or by increasing stability of the protein by 
site-directed mutagenesis. The quality of the 
particular gene can be increased using in vitro and 
site— directed mutagenesis to alter substrate 
utilization as well as the kinetic and regulatory 
properties of the enzyme. Physical properties such 
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as heat stability can also be modified. The 
construction of a vector with the three genes 
transcribed as a single unit under the control of a 
high efficiency promoter results in more efficient 
threonine synthesis. One can also remove the 
L-threonine feedback inhibition of homoserine 
dehydrogenase or the threonine and lysine inhibition 
of aspartokinase to produce the overproduction of 
threonine. The feedback inhibition of homoserine 
dehydrogenase can be removed by in vitro mutagenesis 
using either hydroxylamine and/or sodium bisulfite, 
methods well known to those skilled in the art, or 
by recombinant techniques. The mutagenized plasmids 
are reintroduced into glutamicum and screened for 
AHV resistance or by enzyme assays. Increased 
promoter efficiency can also be accomplished by site 
directed mutagenesis of the existing promoter or by 
replacement with a high efficiency promoter. 

The thrC gene can be placed under the trans- 
criptional control of. high efficiency promoters such 
as the E. coli promoter tac to produce elevated 
levels of the gene product. The expression vector 
PKK233-2, obtained from Pharmacia Fine Chemicals, is 
restricted with Ncol- Hind lll. The plasmid PFS80 is 
cleaved with Bel l, blunt ended with Klenow 

polymerase, and Ncol linkers ligated onto the flush ( 

ends. The ligation product is double digested with 

Ncol and Hindlll, a 2.8 kb fragment purified and 

ligated into similarly digested pKK233-2. The *^ 

resulting recombinant vector designated pKC14 is 
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transformed into E, coli JM105. The requirement for 
IPTG (isopropyl-beta-D-thio-galactopyranoside) 
induction demonstrates that the thrC gene is under 
the transcriptional control of the tac promoter • 
The threonine synthetase activity measured in the 
absence of IPTG was 1*3 nmole/min/mg protein. The 
addition of 2 mM IPTG induces 24 times the threonine 
synthetase (30.9 nmole/min/mg-protein) . 

The homoserihe dehydrogenase anU homoserine 
kinase polypeptides encoded by the open reading 
frames corresponding to the hom and thr B gene 
products can be expressed and purified for analysis. 
The enzymes are purified from 10 liters of MCG broth 
from a CHEMAP fermentator innoculated with a 3 00 ml 
overnight culture of C. qlutamicxim AS019/pFS3.6 
grown for 24 hours at 30 "C with 470 rpm agitation. 
Cells were harvested by ultrafiltration and centri- 
fugation, the cell pellet resuspended in lysis 
buffer (100 mM KPO^ , pH 7.0, 0.5 M KCl) and the 
cells lysed by repeated passage through a French 
pressure cell. Debris is removed by centrif ugation, 
the supernatant precipitated with ammonium sulphate, 
and the enzyme activities separated on a DEAE- 
Sephadex A-50 column eluted with a linear 0.3 M to 
0.8 M KCl gradient. Fractions containing the 
appropriate enzyme activity are pooled and the 
proteins analyzed by SDS polyacrylamide electro- 
phoresis. The proteins are then further purified on 
hydroxylapatite HPLC prior to final separation by 
preprative SDS-PAGE. The purified homoserine 
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dehydrogenase remained active through the procedure 
and has a final molecular weight of 47,000 daltons. 
The activity of the homos erine kinase is lost, but 
the protein had a molecular weight of 3 2,000, 

The observed molecular weights are in close 
agreement with the molecular weights predicted from 
the nucleotide sequences. The NH^ terminus of the 
homoserine dehydrogenase is blocked, however, the 
amino acid composition is in good agreement with the 
amino acid compos^ition predicted from the gene 
sequence. The expression of activity by ligation of 
the Smal -Hindi 11 fragment containing the C. 
<?lutamicum thrPl and predicted NH^ -terminal seven 
amino acid residues of the horn gene product indicate 
that the N-terminal sequence is correct. The first 
ten residues of the thr B gene product, homoserine 
kinase, is in complete agreement with the predicted 
amino acid sequence,. This identifies the 
translation initiation site for the thr B gene as the 
ATG at nucleotide 2342, confirming the predicted 
primary structure of the C« glutamicum homoserine 
kinase. The protein appears to undergo post- 
translational removal of the N-f ormyl-MET, a rela- 
tively common feature of procaryotic proteins. 
construction and application of a C. crlutamicum 
promoter probe . 

A promoter probe for use in identifying, 
isolating, and quantifying the efficiency of 
promoters is based on pWSTl and designated pAL-1. 
The chloramphenicol acetyl transferase (CAT) gene is 
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used as the promoter probe gene. The cat gene is 
expressed in E. coli. ^ B.subtilis , and C, 
qlutamicum . Since cloning of strong promoters can 
induce plasmid instability by transcriptional 
interference at the replication origin, it may be 
necessary to clone the fd enteric bacteriophage 
major gene terminator at the 3* terminus of the test 
gene. Potential promoter sequences can then be 
screened for the acquisition of chloramphenicol 
resistance. Preliminary estimation of promoter 
efficiency can be accomplished by determining the 
extent of antibiotic resistance. 

A number of promoter sources can be screened 
for their efficiency in C. qlutamicum . High effi- 
ciency promoters from other procaryotic systems are 
known, for example, E. coli trp , the hybrid Ptac, 
lambda pH, and B, subtilis Preg. Random C. qluta- 
micum chromosomal DNA and/or corynephage DNA frag- 
ments can also be inserted into the polylinker site, 
upstream in the test gene and cat activity deter- 
mined to assess promoter efficiency. DNA sequencing 
and determination of the transcriptional start sites 
are used to characterize the promoter structure. 
For example, the 266 bp Dral-Haelll fragment which 
spans the predicted thr C promoter region was 
purified and ligated into the Sma l restriction site 
of the pALi-1 polylinker. The resulting recombinant 
mixture was introduced into E. coli and 
chloramphenicol resistant transf ormance obtained. 
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The primary site of metabolic regulation of 

threonine biosynthesis in C > g'lutamiciim is the 

threonine inhibition of homoserine dehydrogenase. 
To remove the metabolic block, in vitro autogenesis 
has been used to alter the hom gene product to 
produce feedback inhibition deficient variants. The 
recombinant plasmid pWFSdelta2207 was used as a 
source of the hom gene. This plasmid expresses a 
lower level of homoserine dehydrogenase (10%) than 
PFS3.6A, eliminating potential artifacts due to the 
overproduction of the hom gene product. Plasmid DNA 
was digested with Kpn l, separated by agarose gel 
electrophoresis and the fragments isolated by 
electroelution into dialysis bags. The 1.43 kb f^pn l 
fragment containing the hom gene was purified and 
treated with hydroxy laimine. This is a potent 
mutagen primarily causing AT to GC and GC to AT 
transitions. The mutated hom gene was isolated and 
3 micrograms of target DNA resuspended in 280 
microliters of 1 M hydroxylamine, 0.3 M KPO^, pH 
6.0, aliquots removed at between 10 and 3 00 minutes 
and the reaction stopped by ethanol precipitation. • 
The mutagenized fragments were religated to the 
large, 12.7 kb, Kpn l restriction fragment and 
transformed into E. coli JM83. 

Plasmid DNA from ampicillin resistant trans- V 
formamts was purified on CsCl gradients and trans- 
formed into the restriction deficient C. glutamicum 
AS019-E12. These transf ormants were then screened 
for resistance to alpha-aminohydroxyvaleric acid 
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(AHV) in order to select for a deregulated hom gene 
product. Homoserine dehydrogenase activity assays 
were used to confirm and demonstrate the removal of 
the L-threonine mediated feedback inhibition. 
Different mutations could be combined by recombinant 
DNA techniques to determine the extent to which they 
are cooperative. 

The present invention, nucleotide sequences 
encoding threonine biosynthetic enzymes, and methods 
and sequences for the expression and regulation of 
expression of these enzyme encoding sequences are 
disclosed. Modifications and variations of this 
invention will be obvious to those skilled in the 
art of genetic engineering from the foregoing 
detailed description. It is intended that these 
modifications ,and variations will fall within the 
scopes of the appended claims. 
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WE CLAIM: 

1. A method for the production of threonine 
comprising: 

providing nucleotide secjuences for the 
genes encoding the enzymes in the threonine 
biosynthetic pathway; smd 

inserting the enzyme encoding sequences 
with selected nucleotide sequences mediating 
the expression and regulation of the enzyme 
encoding sequences int_o an expression vector • 

2. The method of claim 1 further comprising 
inserting said expression vector into an 
expression host. 



3. The method of claim 2 wherein said expression 
host is Corynebacterium .' 

4. The method of claim 1 wherein said nucleotide 
expression and regulation sequences include a 
promoter, further comprising selecting a 
promoter having a higher efficiency than the 
promoter associated with the enzyme encoding 
chromosomal genes and selecting for greater 
efficiency, 

5. The method of claim 4 wherein said higher 
efficiency promoter is obtained by mutating the 
promoter associated with said enzyme encoding 
genes. 
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6. The method of claim 4 further comprising 

selecting said high efficiency promoter from 
the group of promoters isolated from 
Escherchia , Bacillus , Staphylococcus and 
Streptococcus , 

?• The method of claim 1 further comprising 

selecting a multicopy plasmid as the expression 
vector. 

8. The method of claim 1 wherein said expression 
and regulation sequences include a ribosome 
binding site, further comprising selecting for 
a ribosome binding site with increased 
efficiency. 

9. The method of claim 1 further comprising 
combining said enzyme encoding nucleotide 
sequences in a single expression vector. 

10. The method of claim 1 further comprising 
mutating said enzyme encoding nucleotide 
sequences and selecting for temperature 
stability. 

11. The method of claim 1 further comprising 
mutating said enzyme encoding nucleotide 
sequences and selecting for substrate util- 
ization. 
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12. The method of claim 1 wherein said expression 
and regulation signals include a repressor, 
further comprising modifying said repressor. 

13. The method of claim 12 wherein said repressor 
is deleted. 

14. The method of claim 12 wherein said repressor 
is mutated- 

15. The method of claim 12 wherein said repressor 
is replaced with a repressor other than the 
repressor associated with the chromosomal gene 
encoding said sequence. 

16. A method of constructing a promoter probe for 
Corynebacterium amino acid genes comprising: 
isolating nucleotide sequences encoding a 
detectable protein product involved in 
threonine biosynthesis in a Corynebacterium 
host, 

constructing deletions of the 3 * end of said 
nucleotide sequences , 

inserting said deletions into an expression 
vector , 

trcuisf orming said vector into an auxotrophic 
Corynebacterium host, and 

determining if said deletions produce protein 
in auxotrophic Corynebacterium hosts. 
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17. The Tnethod of claim 16 further comprising 
sequencing the nucleotide sequences comple- 
menting the auxotrophic Corynebacterium hosts . 

18. A rDNA sequence comprising hom, thrB, and thrC. 

19 . The rDNA sequence of claim 18 further 
comprising a promoter sequence. 

20. The rDNA sequence of claim 19 further 
comprising a repressor sequence. 

21. The rDNA sequence of claim 19 wherein said 
promoter sequence is selected from sequences 
having a higher translational efficiency than 
the sequences associated with the chromosomal 
DNA. 

22. A Corynebacterium rDNA promoter sequence 
comprising a ribosome binding site TTGGAGA. 

23. A nucleotide sequence hybridizing to a rDNA 
sequence encoding homoserine dehydrogenase in 
Corynebacteria . 

24. A nucleotide sequence hybridizing to a rDNA 
sequence encoding homoserine kinase in Coryne- 
bacteria. 
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25 . A nucleotide, sequence hybridizing to a rDNA 
sequence encoding tiureonine synthase. 

26. A Corynebacterium rDNA translation termination 
sequence comprising 

AAGGAAGGCCCCTTCGAATCAAGA 
AGGGGCCTT. 

27. A Corynebacterium rDNA translation termination 
sequence comprising 

G.ATGGAACCAGGCCTTTCGCATTG 
AGTGGCGTTTTAAGGCCTCCA. 

28. A Corynebacterium rDNA sequence repressing 
translation in the presence of excess 
methionine comprising 

TTTGTTTTGGACAC A T G T T C TAG G 

met stop 

GTGGCCGAAACAAA. 
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FIGURE 3 



1 GTCGACCGCGTGAAGTCGCCCTTTAGGAGAATTCTGACTAACTGGAGCCAAAACrrTGATC 60 

6 1 CACTCGAGAGCTGTGCAGTCTCTTTTTCCTTCAATTCTGCCTGCTCGAGCTCGTAGAAGT 1 20 

121 AGAGGTCTACTTCAGTTGGTTCACCTTGCACACAAGCATGAAGTAGTGGGTAGGTCGAGT 180 

181 TGTTAAATGCGGTGTAGAAGGGGAGTAGTTCGCTAGCAAAGGTTAATTTGGAGTCGCTGT 240 

241 ACTGCGGGTTCTCGGGTGGAGTATTCCCGGAGGATTCAAGAAATCTTGACGCATCTTTGA 300 

301 TGAGGTATGTTTGGAATTCGTCGGCACCTTCCTCGCCGGAGAGGTAGTAGGAGTTGTCGT . 360 

361 AATTTGGAACCCAGATGGCAAATCGTGCGTTTTCGATTGCGTCCAGGACTTCCTCTACGT 420 

421 TGTATCTCGCACTTGTTGCAGCGGAAGCGACTCGGTTGCCGATGTCTCCGTATGCAGTGA 4 80 

481 GCGTGGCGTTTCCGAGGGGAACTTGATCAGAGGAATAC ACCATGGAGCCGATGTCAGAGG 540 

541 CGACTGCGGGCAGATCCTTTTGAAGCTGTTTCACAATTTCTTTGCCCAGTTCGCGC?CGGA ' 600 

601 TCTGGAACCACTTTTGCATGCGATCGTCGTCAGAGTGGTTCATGTGAAAAATACACTCAC 660 

661 CATCTCAATGGTCATGGTGAAGGCCTGTACTGGCTGCGACAGCATGGAACTCAGTGCAAT 720 

72 1 GGCTGTAAGGCCTGCACCAACAATGATTGAGCGAAGCTCCAAAATGTCCTCCCCGGGTTG 780 



wo 88/09819 



PCr/US88/02029 



4/16 

FIGURE 3 (CONT'D) 



781 ATATTAGATTTCATAAATATACTAAAAATCTTGAGAGTTrTTCCGTTGAAAACTAAAAAG 840 



841 CTGGGAAGGTGAATCGAATTTCGGGGCTTTAAAGCAAAAATGAACAGCTTGGTCTATAGT 900 



901 GGCTAGGTACCCTTTTTGTTTTGCACACATGTAGGGTGGCCGAAACAAAGTAATAFF 961 



HetThrSerAlaSerAlaProSerPhe 
961 ACAACGCTCGACCGCGATTATTTTTGGAGAATCATGACCTCAGCATCTGCCCCAAGCTTT 1020 

Translatloa Initiation Codon 



AsnProGlyLysGlyProGiySerAlaValGlylleAlaLeuLeuGlyPheGlyThrVal 
1021 AACCCCGGCAAGGGTCCCGGCTCAGCAGTCGGAATTGCCCTTTTAGGATTCGGAACAGTC 1080 



GlyThrGluValMetArgLeuMetThrGluTyrGlyAspGluLeiiAlaHlsArglleGly 
1081 GGCACTGAGGTGATGCGTCTGATGACCGAGTACGGTGATGAACTTGCGCACCGCATTGGT 1140 



GlyProteuGluValArgGlylleAlaValSerAspIleSerLysProArgGluGlyVai 
1141 GGCCCACTGGAGGTTCGTGGCATTGCTGTTTCTGATATCTCAAAGCCACGTGAAGGCGTT 1200 



AlaProGIuLeuLeuThrGlxiAspAlaPheAlaLeuIleGIiiArgGluAspValAspIle 
1201 GCACCTGAGCTGCTCACTGAGGACGCTTTTGCACTCATCGAGCGCGAGGATGTTGACATC 1260 



ValValGluVallleGlyGlylleGluTyrProArgGluValValLeuAlaAlaLeuLys 
1261 GTCGTTGAGGTTATCGGCGGCATTGAGTACCCACGTGAGGTAGTTCTCGCAGCTCTGAAG 1320 



AlaGlyLysSerValValThrAlaAsnLysAlaLeuValAlaAlaHlsSerAlaGluLeu 
1321 GCCGGCAAGTCTGTTGTTACCGCCAATAAGGCTCTTGTTGCAGCTCACTCTGCTGAGCTT 1380 



AlaAspAlaAlaGluAlaAlaAsnValAspLeuTyrPheGluAlaAlaValAlaGlyAla 
1381 GCTGATGCAGCGGAAGCCGCAAACGTTGACCTGTACTTCGAGGCTGCTGTTGCAGGCGCA 1440 
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FIGURE 3 (CONT'D) 



IleProValValGlyProLeuArgArgSerLeuAlaGlyAflpGlnlleGinSerValMet 
1441 AITCCAGTGGTTGGCCCACTGCGTCGCTCCCTGGCTGGCGATCAGATCCAGTCTGTGATG 1500 

GlylleValAsnGlyThrThrAsnPhelleLeuAspAlaMetAspSerThrGlyAlaAsp 
1501 GGCATCGTTAACGGCACCACCAACTTCATCTTGGACGCCATGGATTCCACCGGCGCTGAC 1560 

TyrAlaAspSerLeuAlaGluAlaThrArgLeuGlyTyrAlaGluAlaAspProThrAla 
1561 TATGCAGATTCTTTGGCTGAGGCAACTCGTTTGGGTTACGCCGAAGCTGATCCAACTGCA 1620 

AspValGluGlyHlsAspAlaAla SerLysAlaAla lleLeuAlaSerlleAlaPleHla 
1621 GACGTCGAAGGCCATGACGCCGCATCCAAGGCTGCAATTTTGGCATCCATCGCTCT 1680 

ThrArgValThrAlaAspAspValTyrCysGluGlylleSerAsnlleSerAlaAlaAsp 
1681 ACCCGTGTTACCGCGGATGATGTGTACTGCGAAGGTATCAGCAACATCAGCGCTGCCGAC 1740 

IleGluAlaAlaGlnGlnAlaGlyHisThrlleLysLeuLeuAlalleCysGluLysPhe 
1741 ATTGAGGCAGCACAGCAGGCAGGCCACACCATCAAGTTGTTGGCCATCTGTGAGAAGTTC 1800 

ThrAsnLysGluGlyLysSerAlalleSerAlaArgValHiaProThrLeuLeuProVal 
1801 ACCAACAAGGAAGGAAAGTCGGCTATTTCTGCTCGCGTGCACCCGACTCTATTACCTGTG 1860 

SerHlsProLeuAlaSerValAsnLysSerPheAsnAlallePheValGluAlaGluAla 
1861 TCCCACCCACTGGCGTCGGTAAACAAGTCCTTTAATGCAATCTTTGTTGAAGCAGAAGCA 1920 

AlaGlyArgLeuMetPheTyrGlyAsnGlyAlaGlyGlyAlaProThrAlaSerAlaVal 
1921 GCTGGTCGCCTGATGTTCTACGGAAACGGTGCAGGTGGCGCGCCAACCGCGTCTGCTGIC 1980 

LeuGlyAspValValGlyAlaAlaArgAsnLysValHlsGlyGlyArgAlaProGlyGlu 
1981 CTTGGCGACGTCGTTGGTGCCGCACGAAACAAGGTGCACGGTGGCCCTGCTCCAGGTGAG 2040 
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FIGURE 3 (CONT'D) 



SerThrTyrAlaAsnLeuProIleAlaAspPheGlyGluThxThrThrArgTyrHisLeu 
204 1 TCCACCTACGCTAACCTGCCGATCGCTGATTTCG ,TGAGACCACCACTCGTTACCACCTC 2100 

AspMetAspValGluAspArgValGlyValLeuAlaGluLeuAlaSerLeuFheSerGlu 
2101 GACATGGATGTGGAAGATCGCGTGGGGGTTTTGGCTGAATTGGCTAGCCTGTTCTCTGAG 2160 



GInGlylleSerLeuArgThrlleArgGlnGluGluArgAspAspAspAlaArgLeuIle 
2161 CAAGGAATCTCCCTGCGTACAATCCGACAGGAAGAGCGCGATGATGATGCACGTCTGATC 2220 



ValValThrHisSerAlaLeuGluSerAspLeuSerArgThrValGluLeuLeuLysAla 
2221 GTGGTCACCCACTCTGCGCTGGAATCTGATCTTTCCCGCACCGTTGAACTGCTGAAGGCT 2280 



LysProValValLysAlalleAsnSerVallleArgLeuGluArgAsp 
2281 AAGCCTGTTGTTAAGGCAATCAACAGTGTGATCCGCCTCGAAAGGGACTAATTTTA 2340 

Stop 



Predicated start of • thrB translation 

MetAlalleGlixLeuAsnValGlyArgLysValThrVaiThrValProGlySerSerAl 
2341 CATGGCAATTGAACTGAACGTCGGTCGTAAGGTTACCGTCACGGTACCTGGATCTTCTGC 2400 
Translation initiation Codon 



aAsnLeuGlyProGlyPheAspThrLeuGlyLeuAlaLeuSerValTyrAapThrValGl 
2401 AAACCTCGGACCTGGCTTTGACACTTTAGGTTTGGCACTGTCGGTATACGACACTGTCGA 2460 



uValGluIlelleProSerGlyLeuGluValGluValPheGlyGluGlyGlnGlyGluVa 
2461 AGTGGAAATTATTCCATCTGGCTTGGAAGTGGAAGTTTTTGGCGAAGGCCAAGGCGAAGT 2520 



LProLeuAspGlySerHisLeuValValLysAlalleArgAlaGlyLeiiLysAlaAlaAs 
2521 CCCTCTTGATGGCTCCCACCTGGTGGTTAAAGCTATTCGTGCTGGCCTGAAGGCAGCTGA 2580 



pAlaGluValProGlyLeuArgValValCysHlsAsnAsnlleProGlnSerArgGlyLe 
2581 CGCTGAAGTTCCTGGATTGCGAGTGGTGTGCCACAACAACATTCCGCAGTCTCGTGGTCT 2640 
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FIGURE 3 (CONT'D) 



OGlySerSerAlaAlaAlaAlaValAlaGlyVaLAlaAlaAlaAsnGlyLeuAlaAspPh 
2641 TGGCTCCTCTGCTGCAGCGGCGGTTGCTGGTGTTGCTGCAGCTAATGGTTTGGCGGATTT 2700 

EProLeuThrGlnGluGlnlleValGlnLeuSerSerAlaPheGluGlyHisProAspAs 
2701 CCCGCTGACTCAAGAGCAGATTGTTCAGTTGTCCTCTGCCTTTGAAGGCCACCCAGAIAA 2760 

nAlaAlaAlaSerValLeuGlyGlyAlaValValSerTrpThrAsnLeuSerlleAspGl 
2761 TGCTGCGGCTTCTGTGCTGGGTGGAGCAGTGGTGTCGTGGACAAATCTGTCTATCGACGG 2820 

yLysSerGlnProGlnTyxAlaAlaValProLeuGluVaiGlnAspAsnlleArgAlaTh 
2821 CAAGAGCCAGCCACAGTATGCTGCTGTACCACTTGAGGTGCAGGACAATATTCGTGCGAC 2880 

rAlaLeuVaiProAsnPheHisAlaSerThrGluAlaVaiArgArgValLeuProThrGi 
2881 TGCGCTGGTTCCTAATTTCCACGCATCCACCGAAGCTGTGCGCCGAGTCCTTCCCACTGA 2940 

uValThrHisIleAspAlaArgPheAsnValSerArgValAlaVaLMetlleValAlaLe 
2941 AGTCACTCACATCGATGCGCGATTTAACGTGTCCCGCGTTGCAGTGATGATCGTTGCGTT 3000 

uGlnGlnArgProAspLeuLeuTrpGluGlyThrArgAspArgLeuHlsGlnProTyrAr 
3001 GCAGCAGCGTCCTGATTTGCTGTGGGAGGGTACTCGTGACCGTCTGCACCAGCCTTATCG 3060 

gAlaGluValLeuProIleThrSerGluTrpValAsnArgLeuArgAsnArgGlyTyrAl 
3061 TGCAGAAGTGTTGCCTATTACCTCTGAGTGGGTAAACCGCCTGCGCAACCGTGGCTACGC 3120 

aAlaTyrLeuSerGlyAlaGlyProThrAlaMetValLeuSerThrGluProIleProAs 
3121 GGCATACCTTTCCGGTGCCGGCCCAACCGCCATGGTGCTGTCCACTGAGCCAATTCCAGA 3180 

pLysValLeuGluAspAlaArgGluSerGlylleLysValLeuGltxLeuGluValAlaGl 
3181 CAAGGTTTTGGAAGATGCTCGTGAGTCTGGCATTAAGGTGCTTGAGCTTGAGGTTGCGGC 3240 
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FIGURE 3 (CONT'D) 

yProValLysValGluValAsnGlnPro 
3241 ACCAGTCAAGGTTGAAGTTAACCAACCTTAGGCCCAACAAGGAAGGCCCCTTCGAATCAA 3300 

Stop Computer predicted 

3301 GAAGGGGCCTTATTAGTGCAGCAATTATTCGCTGAACACGTGAACCTTACAGGTGCCCGG 3360 
translation interminatlon point 

3361 CGCGTTGAGTGGTTTGAGTTCCAGCTGGATGCGGTTGTTTTCACCGAGGCTTTCTTGGAT 3420 

3421 GAATCCGGCGTGGATGGCGCAGACGAAGGCTGATGGGCGTTTGTCGTTGACC^CAAATGG 3480 

3481 GCAGCTGTGTAGAGCGAGGGAGTTTGGTTCTTCGGTTTCGGTGGGGTCAAAGCCCATTTC 3540 

3541 GCGGAGGCGGTTAATGAGCGGGGAGAGGGCTTCGTCGAGTTCTTCGGCTTCGGCGTGGTT 3600 

360 1 AATGCCCATGACGTGTGCCCACTGGGTTCCGATGGAAAGTGCTTTGGCGCGGAGGTCGGG 3660 

3661 GTTGTTGCATTGCGTCATCGTCGAC 3685 
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FIGURE 4 

1 TGACCGAGAGTTTTTTTGAGCAACTGGATCATTAGATAATTGCT^ 60 

61 ATCACCCGTTATGGAGACCTACTGGAATTGAGCCCAGAAACCGTCGATGTGTGCCTCAAC 120 

121 GTAGGGGTAAAGCCACGGCCCGAGCAGCACCAGGCCGACCGCGAGCACCGAACAACCAAT 1 80 

181 GAGAACATACAGGTTCCACTTGGACACCGGCGCTGGATTAAGGATTTCAACTGCGGTGAG 240 

241 ATTCTTCTTGTTGTTGTCCTCGAGTTTCGAGAAGCTGGGGTAATCGGGAGCTGTCATCTT 300 

301 XAAAGCACATCCTAAAACCGACAATTGAAAGTGATCAGCAACACTTTAGGGTATCGCGTG 360 

Predicted start of thrC translation 

ValGlyGluTyxCysValThrProT 
361 GGCGAAGTCACCTTTTTCAACATATTTGAGACGGTGTGGGGGAGTAXTGTGTCACCCCTT 420 

rlbosome binding sequence 

rpUeGlyLeuTyrProTrpThrThrPheArgProArgAspAlaSerArgXhrProAlaA 
42 1 GGATAGGGTTATATCCGTGGACTACATTTCGACCGCGTGATGCCAGCCGTACCCCTGCCC 480 

rgPheSerAspIleLeuLeuGlyGlyLeuAlaProAspGlyGlyLeuTyrLeuProAlaT 
481 GCTTCAGTGATATTTTGCTGGGCGGTCTAGCACCAGACGGCGGCCTGTACCTGCCTGCAA 540 
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FIGURE 4 (CONT'D) 



hrTyrProGlnLeuAspAspAlaGlnLeuSerLysXrpArgGluValLeuAlaAstiGluG 
5A 1 CCTACCCTCAACTAGATGATGCCCAGCTGAGTAAATGGCGTGAGGTATTAGCCAACGAAG 600 



lyTyrAlaAlaLeuAlaArgGluVallleSerLeuPheValAspAspIleProValGluA 
60 1 GATACGCAGCTTTGGCTCGTGAAGTIATCTCCCTGTTTGTTGATGACATCCCAGTAGAAG 660 



spIleLysAlalleThrAlaArgAlaTyrThrXyrProLyaPheAsiiSerGluAspIleV 
66 1 ACATCAAGGCGATCACCGCACGCGCCTACACCTACCCGAAGTTCAACAGCGAAGACATCG 720 



alProValXhrGiuLeuGluAspAsiirieTyrLeuGlyHlsLeuSe^ 
721 TTCCTGTCACCGAACTCGAGGACAACATTTACCTGGGCCACCTTTCCGAACCCGCAACCG 780 



laAlaPheLysAspMetAlaMetGlnLeuLeuGlyGluLeuPheGluTyrGliiLeuArgA 
781 CTGCATTCAAAGACATGGCCATGCAGCTGCTCGGCGAACTTTTCGAATACGAGCTTCGCC 840 



rgA^^gAsnGluThrlleAsnlleLeuGlyAlaThrSerGlyAspThrGlySerSerAlaG 
84 1 GCCGCAACGAAACCATCAACATCCTGGGCGCTACCTCTGGCGATACCGGCTCCTCTGCGG 900 



luTyrAlaMetArgGlyArgGluGlylleArgValPheMetLeuThrProAlaGlyArgM 
901 AATACGCCATGCGCGGCCGCGAGGGAATCCGCGTATTCATGCTGACCCCAGCTGGCCGCA 960 



etXhrProPheGlnGlnAlaGlnMetPheGlyLeuAspAspProAsnllePheAsnlleA 
961 TGACCCCATTCCAGCAAGCACAGATGTTTGGCCTTGACGATCCAAACATCTTCAACATCG 1020 



laLeuAspGlyValPheAspAspCysGlnAspValValLysAlaValSerAlaAspAlaG 
102 1 CCCTCGACGGCGTTTTCGACGATTGCCAAGACGTAGTCAAGGCTGTCTCCGCCGACGCAG 1080 



luPheLysLysAspAsnArglleGlyAlaValAsnSerlleAsnTrpAlaArgLeuMetA 
1081 AATTCAAAAAAGACAACCGCATCGGTGCCGTGAACTCCATCAACTGGGCACGCCTTATGG 1140 
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FIGURE 4 (CONT'D) 



laGlnValValTyrTyrValSerSerTrpIleArgThrThrThrSerAsnAspGlnLysV 
1141 CACAGGTTGTGTACTACGTTTCCTCATGGATCCGCACCACAACCAGCAATGACCAAAAGG 1230 



alSerPheSerValProThrGlyAsnPheGlyAspIleCysAlaGlyHialleAlaArgG 
1 20 1 TCAGCTTCTCCGTACCAACCGGCAACTTCGGTGACATTTGCGCAGGCCACATCGCCCGCC 1260 



InMetGlyLeuProIleAspArgLeuIleValAlaThrAsnGltiAsnAspValLeuAspG 
1261 AAATGGGACTTCCCATCGATCGCCTCATCGTGGCCACCAACGAAAACGATGTGCTCGACG 1320 



luPhePheArgThrGlyAspTyrArgValArgSerSerAiaAspThrHlsGlufhrSerS 
1 32 1 AGTTCTTCCGTACCGGCGACTACCGAGTCCGCAGCTCCGCAGACACCCACGAGACCTCCT 1380 



erProSerMetAspIleSerArgAlaSerAsnPheGluArgPhellePheAspLeuLeuG 
1381 CACCTTCGATGGATATCTCCCGCGCCTCCAACTTCGAGCGTTTCATCTTCGACCTGCTCG 1440 



lyArgAspAlaXhrArgValAsnAspLeuPheGlyThrGlnValArgGlnGlyGlyPheS 
144 1 GCCGCGACGCCACCCGCGTCAACGATCTATTTGGTACCCAGGTTCGCCAAGGCGGATTCT 1500 



erLeuAlaAspAspAlaAsnPheGluLysAlaAlaAlaGluTyrGlyPheAlaSerGlyA 
1 50 1 CACTGGCTGATGACGCCAACTTTGAGAAGGCTGCAGCAGAATACGGTTTCGCCTCCGGAC 1560 



rgSerThrHisAlaAspArgValAlaXhrlleAlaAspValHisSerArgLeuAspValL 
156 1 GATCCACCCATGCTGACCGTGTGGCAACCATCGCTGACGTGCATTCCCGCCTCGACGTAC 1620 



eulleAspProHisThrAlaAspGlyValHlsValAlaArgGlnTrpArgAspGluValA 
1621 TAATCGATCCCCACACCGCCGACGGCGTTCACGTGGCACGCCAGTGGAGGGACGAGGTCA 168Q 



snThrProIlelleValLeuGluThrAlaLeuProValLysPheAlaAspThrlleValG 
1681 ACACCCCAATCATCGTCCTAGAAACTGCACTCCCAGTGAAATTTGCCGACACCATCGTCG 1740 
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FIGURE 4 (CONT'D) 

luAlalleGlyGluAlaProGlnThrProGluArgPheAlaAlalleMetAspAlaProP 
1741 MG(yU^TGGTGAAGCACCTCAAACTCCAGAGCGTTTCGCCGCGATCATGGATC ^1800 

heLysValSerAspLeuProAsnAspThrAspAlaValLysGlnXyrlleValAspAlal 
1801 TCAAGGTTTCCGACCTACCAAACGACACCGATGCAGTTAAGCAGTACATAGTCGATGCGA 1860 

leAlaAsnThrSerValLys 
1861 TTGCAAAC ACTTCCGTGAAGTAACTTGCTTTACGCCAAGGCCTGATTCCTCTCTTTATG^ 1920 

1921 GATGGAACCAGGCCTTTCGCATTGAGTGGCGTTTTAAGGCCTCCAAT^ 1980 
Computer predicted terminator structure, *+termination point 

1981 GTTTGACATGGAGGGGTCACAGTCAAGCCGTTAGAAGCGATTCTGGGAGG^ 2040 

2041 CGGAGTTGGAGGTCGAATTTCCGCTGAACTGATGGGAACCAGACAGGCGTGACAAGATTG 2100 

2101 GCTAAAAACCTGAAGTTTTGTCACGCCTGTCTGGTTTCCCTCTTGTCGGTGCGAGCGAGT 2 160 

2161 CCCTTGAACGACACAGATCGCGCCAAATGGAAGTGTCTGCGACCCCAGAATATTTGATTC 2220 

2221 CCCGGTCCGAGTCGTGCGAAAAATGCTCTGGTTAGTCCTCGATCATCGCAATCGCATCAA 2280 

2281 TTTCCACAGTTGCACCATAAGGAAGCGATGATGCACCCACGAAAGAGCGTGCCGGGCGGC 2340 

2341 CTTCGAGGAAATGCTCTCGGAATTGCTCGTTGCATTCTTCGCGCAGGCTGATGTCGGTGA 2400 

240 1 - -CAAAGTAAGTGAGTTTCACAACGTCTTTGAGTTCACCACCAGCGGTCGGAGGCGTTCACG 2460 

2461 CATGCGTTCAAGTGCTGCATCAACTGCTTCTTTACGACCGACGACTGGTTGGTAGTCCTT 2520 
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FIGURE 4 (CONT'D) 
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252 1 GTCTACTGAAAGAGCGCCGGAGACGAAGATGAAATTTCCGACGCGTTTTGCGGGGGACTT 2580 

2581 ATGGGTGATCATTCGACATGTGGCCAACCATAGCTGTTTCCCCGAAGAGAGTGCCGGAAC 2640 

264 1 AGGCATTTTAGAGGTGGGGGAGCACTTCTTCGTAAATCTGGGTCAGTACTTCGCTTGCTG 2700 

2701 GTCGGCGCTGGATGTTGAAGATGACGTGGTCGATGCCAAGTTCCGAAAGCGGTGGAGGTC 2760 

2761 TTGGAa^GAGTTCGTGGGTGCGTACCTCTACGCCAGAGTGATTTCTTTGTGGGTGTTTCC^ 2820 

282 1 TCGGTGAGGTTGAGCCCCATGGAGGAAATCAACAAGGGGCGGGTGCCACCACGGGCTTTG 2 880 

288 1 TCCCAGAGATCGAGGCGTCCGACTTGAGCTTCAGCGGGGCGGTAGTAGGTTGCCCATCCG 2940 

2941 TCGGCGTTTCGGGCGATCCATTGCACTGTTTGTCGGGCAGAACCTACAGCGATCATGGGG 3000 

3001 ATCTGAGCTTCAGGTGGCGTGGTTGGCGCAAATTCAAGGTCGGCCCGCATCGCAGGATCC 3060 

3061 TTCGACAAAGCTGCACGCAAAATTGCCCACCCAGACTGAATATCAGCGCGTCGATTGTCT 3120 

3121 AAGCTTTTCGGAAAAATCTCGAATTC 3146 
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WCGGGTTGATATTAGATTTCATAAATATACTAAAAATCTTGAGAGTTTTTCCGTTGAAA 
66GCCCAACTATAATCTAAA6TATTTAIATGATTTTTAGAACTCTCAAAAAGGCAACTTT 

Dral 
-35 

ACrrAAAAAGCTGGGAAGGTGAATCGAATTTCGGGGCTTIAAAGCAAAAATGAACAGCTTG 
TGATTTTTCGACCCTTCCACTTAGCTTAAAGCCCCGAAATTTCGTTTTTACTTGTCGAAC 
mRNA start hyphenated dyad sjnnmetry 

-10 . _ * — « I \Met*** . 

GTCTATAGTCSSCTAiSCTACCCTTTrTGTTTTGGAC^^ 

CAGATATCACCGATCCATGGGAAAAACAAAACCTGTGTACATCCCACCGGCTTTGTTTCA 

predicted start of thr A translation 

ThrSerAlaSerAla 

AATAGGACAACAACGCTaSACCGCGATTATTTTTGGAGAATCATGACCTCAGCATCTGCC 
TTATCCTGTTGTTGCGAGCTGGCGCTAATAAAAACCTCTTAGTACTGGAGTCGTAGACGG 
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